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ABSTRACT 

Geographic routing with greedy relaying strategies have been widely 
studied as a routing scheme in sensor networks. These schemes as- 
sume that the nodes have perfect information about the location of 
the destination. When the distance between the source and des- 
tination is normalized to unity, the asymptotic routing delays in 
these schemes are Q( M , n *, ), where M(n) is the maximum dis- 
tance traveled in a single hop (transmission range of a radio). 

We consider three scenarios: (i) where nodes have location 
errors (imprecise GPS), (ii) where only coarse geographic infor- 
mation about the destination is available, such as the quadrant or 
half-plane in which the destination is located, and ( Hi) where only 
a small fraction of the nodes have routing information. In this pa- 
per, we show that even with such imprecise or limited destination- 
location information, the routing delays are €)( Af | n ) )■ We further 
show that routing delays of this magnitude can be obtained even if 
only a small fraction of the nodes have any location information, 
and other nodes simply forward the packet to a randomly chosen 
neighbor, and we validate our analysis with simulation. 

Finally, we consider the throughput-capacity of networks with 
progressive routing strategies that take packets closer to the desti- 
nation in every step, but not necessarily along a straight-line. Such 
a routing strategy could potentially lead to spatial "hot spots" in the 
network where many data flows intersect at a spatial region (a node 
or group of nodes), due to "sub-optimal" routes with increased 
path-lengths. In this paper, we show that the effect of hot spots 
due to progressive routing does not reduce the network throughput- 
capacity in an order sense. In other words, the throughput-capacity 
with progressive routing is order-wise the same as the maximum 
achievable throughput-capacity. 



1. INTRODUCTION 

The availability of cheap wireless technology and the emergence 
of micro-sensors based on MEMS technology will enable the ubiq- 
uitous deployment of sensor networks l24l fT1|5). Applications for 
sensor networks include robust communication, intrusion detec- 
tion and commercial applications involving macro-scale measure- 
ments and control. Such networks are characterized by the absence 
of any large-scale established infrastructure, and nodes cooperate 
by relaying packets to ensure that the packets reach their respective 
destinations. 

A popular routing algorithm for sensor network that has been 
widely studied is geographic routing 1121 1131 fill |p|. The main 
idea is to forward a packet to a node that is closer to the final 
destination than the current packet position (a greedy forwarding 
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Fig. 1. Random Routing 



strategy). When greedy forwarding fails (due to dead-ends or rout- 
ing loops), alternate routing methods such as perimeter routing, 
or route discovery based methods (using flooding) have been pro- 
posed nnnni. 

In this paper, we study the problem of geographic routing 
with limited or erroneous destination-location information. For 
instance, suppose that nodes only know the quadrant or the half- 
plane on which the final destination is. A node could then ran- 
domly forward the packet to an arbitrary node that is in that di- 
rection. As another example, suppose that nodes have the correct 
destination coordinates. However, the GPS at nodes are erroneous 
(and possibly biased), as a result of which packets are routed in the 
wrong direction. 

1.1. Main Contributions 

We consider a large-scale network where nodes are deployed over 
a unit region. Each node's maximum transmission range is scaled 
as M(n) = K\J (log n/n), for some K > 1. For K large enough, 
and n large enough, results in 1 8 23 1 ensure that straight line rout- 
ing (greedy geographic routing) is possible without recourse to 
face routing (the "loop-around" strategy employed when straight- 
line routing fails due to dead ends). 

We first consider the case where nodes have precise destina- 
tion coordinates. However, we assume that the GPS at nodes are 
imprecise. We model this by assuming that each routing step has 
an angular error 1 that is random. In other words, nodes attempt to 
perform greedy straight-line routing. However, due to the angular 
error, the packet is forwarded to a random node that is in some 
sector within angles (j>i and 4>2 (illustrated in Figure|3j- 
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1 Note that by expressing the position of a node in polar coordinates, the 
radial component of the error will not affect geographic routing; however 
the angular component could point in the wrong direction. Thus, we model 
GPS errors by randomness in the angular component. 



We then consider the case where nodes have limited destina- 
tion information. In particular, we consider the case where each 
node has only a coarse estimate - such as quadrant or half-plane 
information. In other words, each node has a coordinate system 
(a local notion of 'North') that need not be common to all nodes. 
All that each node knows is that the local quadrant in which the 
destination lies (or the half-plane in which the destination lies). In 
each of these cases, the routing strategy that is adopted is to simply 
forward the packet to a randomly selected node in the appropriate 
quadrant (or the half-plane). 

We also consider the case where only a small fraction of the 
nodes have any routing information at all. Most nodes simply for- 
ward the packet to a randomly selected neighbor. A small fraction 
of the nodes have quadrant information (as discussed earlier). This 
could be distributed by some gossip mechanism 1171 1 141 . where 
nodes forward routing information, but also forget this informa- 
tion after some time. We consider a simple model where a node 
has routing information with some fixed probability p 6 (0, 1), 
in which case, it routes to the appropriate quadrant and other-wise 
randomly routes the packet to an arbitrary neighbor. 

Finally, we consider the throughput-capacity in networks for 
the special case of progressive routing strategies where the pack- 
ets are transported closer to their destinations in each step, but not 
necessarily along a straight-line. Such a routing strategy could po- 
tentially lead to spatial "hot spots" in the network where many data 
flows intersect at a spatial region (a node or group of nodes), due 
to "sub-optimal" routes with increased path-lengths. For example, 
consider Figure0a)(i), where n source-destination pairs have non- 
intersecting paths (each of length '1'), thus resulting in a (normal- 
ized) throughput of ' 1' for each pair. On the other hand, let us now 
allow the path length between each source-destination pair to be no 
more than length '3' (due to imperfect routing). Then, in the worst- 
case, all the paths will share a bottle-neck node, thus decreasing 
the throughput per source-destination pair to be 1/n (illustrated in 
Figure0a)(ii)). With n randomly placed source-destination pairs 
and straight-line routing (see Figure0a)(iii)), it has been shown in 
1 8 1 that the throughput-capacity per source-destination pair scales 
as l/y nlog(n). Analogous to FigureQa)(ii), imperfect routing 
strategies, even when the path length is increased by only a con- 
stant factor (non-straight-line routing) could lead to hot-spots, thus 
decreasing throughput-capacity in an order-wise sense. In this pa- 
per, we show that the effect of hot spots due to progressive routing 
does not reduce the network throughput-capacity in an order sense. 
In other words, the throughput-capacity with progressive routing 
is order-wise the same as the maximum achievable throughput- 
capacity. 

The main contributions in this paper are the following: 

(i) We show that the time to reach the destination with erro- 
neous angular information or limited information (quad- 
rant information) is within a constant factor of straight-line 
greedy routing. We derive upper and lower bounds on the 
routing delay which are asymptotically tight (in n). 

(ii) We show that even in the case where only a fixed fraction 
of the nodes have routing information, the routing delay is 
within a constant factor of straight-line routing.Thus, this 
implies that for any fixed p £ (0, 1) , we can achieve a delay 
within a constant factor of the optimal strategy. The trade- 
off is that the constant factor scales as =• . 

v 

(iii) In the delay analysis, we adopt a continuum model of a 
sensor network where packets are routed along points on 



the plane, and each hop has a step-size that is bounded by 
M{n). In Section|6| we validate the analytical results us- 
ing simulations where the discretization effects due to node 
locations are accounted for. 

(iv) For networks with progressive routing strategies, we show 
that although hot spots might occur, they are not severe 
enough to reduce the throughput-capacity in an order-wise 
sense. 

We comment that for the strategies considered, suppose that 
we had a deterministic progress toward the destination, then it is 
easy to see that the routing delay 2 will be order-wise equivalent 
to straight-line routing. For example, in Figure 0b)(i), a packet 
from source 'S' to destination 'D' is routed such that the packet's 
location at each subsequent hop lies in a sector oriented toward 
the final destination in a manner such that there is a deterministic 
lower-bound on the progress toward the destination. This leads to 
an appropriate deterministic upper-bound on the routing delay. 

However, if a deterministic positive step does not occur, (as 
in FigureQb)(ii)), then it is possible that the delay is significantly 
larger. It is reasonable to expect that if the expected distance is 
positive (as in (ii)), we should expect the delay to be order-wise 
equivalent to straight-line routing, with a proportionality constant 
equal to the inverse of the mean distance traveled in every jump. 
Indeed, this would be true if the progress toward the destination 
in subsequent hops were independent and identically distributed 
(i.i.d.), or such that some form of the law of large numbers were 
satisfied. However in our case, the progress (the difference be- 
tween \SD\ and \AD\ in Figure 0b)(ii)) at subsequent hops are 
neither independent, nor identically distributed. In fact, the mean 
progress gets smaller as we proceed towards the destination and 
the sequence is correlated. We show that even under these circum- 
stances, we can upper and lower bound the projections of subse- 
quent steps by a sequence of i.i.d random variables, and use these 

1.1. d. bounds to derive asymptotically tight bounds on the routing 
delay. 

1.2. Related Work 

There has been considerable interest in greedy geographic routing 
and the associated recovery mechanisms to route around dead-ends 
II 3111 2III5III6I . as well as its applications 1211 . 

The idea that approximate information may be sufficiently far 
away from the destination has been explored in the context of mo- 
bile ad hoc networks. In |20], the authors propose the Fish-eye 
state routing, where nodes exchange link state information with a 
frequency that depends on the distance from the destination. The 
idea that nodes far away from the destination requires less precise 
information has been exploited in |3|, where the authors propose 
lazy update mechanisms for routing tables. In |6|, the authors ex- 
ploit such an effect in the context of mobile nodes to propose Last 
Encounter routing, where mobile nodes remember their last en- 
counter time and location with other nodes. They show that with 
sufficient mobility, such schemes result in a performance that is 
within a constant factor of the best-case routing. In the context 
of geographic routing, 1 2 1 have proposed a routing protocol where 
a set of embedded (circular) geographic routing zones are defined 
about the destination. In each zone, a packet travels along a greedy 

2 However, as discussed earlier, it is not clear even in this case if the 
throughput-capacity is unchanged in an order sense. We prove in SectionlTl 
that the throughput-capacity does not decrease in an order sense. 




Fig. 2. Routing as a hopping process 



In this paper, we study routing behavior with limited information 
in the large n regime (i.e., n — + oo). From results in [7 8], such 
a scaling of the radius (equivalently, the peak transmission power) 
leads to a sensor network with n randomly placed nodes being 
(asymptotically) connected. Further, from results in |8][23|, for 
K large enough (but finite), this scaling ensures that straight line 
routing (greedy geographic routing) is possible without recourse to 
face routing (the "loop-around" strategy employed when straight- 
line routing fails due to dead ends). 

For each point 'A', we define its neighborhood set as the col- 
lection of points 



path toward the center of the next-level zone (a tighter circle about 
the destination). When it enters the next level zone, a course cor- 
rection occurs, and the packet is routed in a greedy manner toward 
the center of the next-level zone. Thus, as the packet gets closer to 
the destination, more detailed information is available, leading to 
a sequence of course corrections. Using simulations, the authors 
have shown that such a scheme is a bandwidth-efficient routing 
protocol for large-scale networks. 

In 1181 , the authors formulate the local topology knowledge 
needed for optimal energy efficient geographic routing using an 
integer linear program, and propose Partial Topology Forwarding 
Routing. Related work also includes geographic routing with lo- 
calization errors (where a node does not know its own position 
precisely). In 1 10 1, the authors show using simulations that local- 
ization errors of less that 0.4 times the transmission radius does 
not impact the performance of greedy forwarding in geographical 
routing. In |22|, the authors study the effect of localization errors 
on face routing. They first derive failure modes with localization 
errors (such as routing loops, cross links, and excessive edge re- 
movals). Next, using simulations, the authors in |22| argue that 
even a 10% localization errors can significantly impact the per- 
formance of face routing (perimeter routing). However, when the 
sensor network size is large, it has been shown in [8| l23l that with 
high probability, greedy routing will succeed (i.e., recovery mech- 
anisms such as face routing will be required with small probabil- 
ity). In this paper we study such large-scale sensor networks, and 
analyze the performance of randomized-geographic routing algo- 
rithms with limited information. We show that the delay with such 
schemes is asymptotically (order-wise) equivalent to straight-line 
(greedy) routing. Using simulations, we finally validate our anal- 
ysis. 

In Section|2| we describe the system model. In Sections 13141 
and [5] we derive the delay asymptotics for routing with impre- 
cise and limited information. In Section|6| we present simulation 
results. Finally, in Section Q we derive the achievable through- 
put for progressive routing schemes and show that it is order-wise 
equivalent to the upper bound on throughput capacity. 

2. SYSTEM DESCRIPTION 

We consider a unit region over which sensor nodes are deployed. 
All nodes are assumed to have the same (maximum) transmission 
range and can transmit to any node within its transmission radius. 
The transmission regions are assumed to be circular. For a fixed 
K > 1, We suppose that the common transmission range for all 
the sensors is 



M{n) = K^{\ogn/n) (1) 



A M (n) = {Xell 2 : \XA\ < M(n)}, (2) 

where \XA\ is the Euclidean distance between 'X' and A'. 

In this paper, we ignore the discretization effects due to node 
position (see also [ 9 ] for a similar model). In other words, suppose 
that a packet at location A' needs to be transmitted using geo- 
graphic routing to the destination at location 'O' as in Figure |2| 
Then, we assume that at the next hop, the packet is routed to the 
point 'Z' in Figure|2| For instance, suppose that the network is a 
grid network with n nodes over the unit square (i.e., -4^ distance 
between nodes). Then, in practice, straight-line routing would lead 
to the packet at A' being routed to the node closest to the point 'Z' . 
In this paper we ignore this discretization error, as this asymptoti- 
cally vanishes (the error is at most ^7= , whereas the transmission 

radius is K^J (logn/n), which is order-wise larger). As another 
example, suppose that we employ a random routing scheme, where 
the packet at A' needs to be routed to a randomly chosen node in 
the sector 'FAC (see Figure^- Then, we assume that the packet 
is forwarded to a point 'B' whose location is uniformly distributed 
over this sector. To summarize, we adopt a continuum model of a 
sensor network where we route along points on the plane, and each 
hop has a step-size that is bounded by M(n) . In Section|6| we vali- 
date the analytical results we derive in this paper using simulations 
where the discretization effects are accounted for. 

We employ a two-tier routing model in this paper. We consider 
an e(n) ball about the destination (see Figure |2j. When a packet 
is within this e(n) ball (which is arbitrarily close to the destina- 
tion, as n increases), we assume that nodes have sufficient routing 
information to employ straight-line routing. However, for nodes 
outside this e(n) ball, we consider various routing strategies with 
limited information. 

Our objective in this paper is to quantify the amount of routing 
resources required over the network (for instance, the metric could 
be the average number of bits per node in the network to route from 
a randomly chosen origin to the destination). Physically, the ball 
around the destination corresponds to e(n) destination-location ad- 
vertisement, which is order-wise negligible in the sense that the 
number of nodes in this ball is vanishingly small when compared 
to the total number of nodes in the network. We also require that 
the destination ball is larger than each hop step size to overcome 
edge effects. Thus, we choose a suitable ball size of e(n) = rT x ^ 
in this paper (we use the parameter 1/4 for notational convenience; 
our proofs work for any radius that is order-wise larger than each 
hop step-size). 

With this setup, let us define Y (£) to be the Euclidean dis- 
tance traveled towards the destination in the i th step (and when 
the transmission range is M(n)), by following the relay strategy 



7T. We define the routing delay r(n) for this strategy as follows 

3 

r(n) = sup {j : £ Y (n) (;) <d- e(n)}, (3) 
i=i 

where d is the Euclidean distance between the source and the des- 
tination. Thus, r(n) represents the hitting time corresponding to a 
path entering the e(n) ball, when the transmission radius is M(n) . 
We say that a routing strategy n has an order-wise straight-line 
routing delay if the random variable r„-(n) = 6( M | n j ), as this is 

(order-wise) 3 the number of steps required for deterministic jumps 
of size M(n) to reach the destination a unit distance away along a 
straight-line path. 

3. ANALYSIS OF ROUTING WITH SECTOR 
INFORMATION 

In this section, we consider the situation where all nodes know the 
destination location perfectly, but have imprecise GPS information 
about their positions. This error in position contributes to an an- 
gular error in the direction of the destination. Hence, when the 
node wishes to transmit, the choice of neighbor is not along the 
correct direction to the destination, but in a sector within angles 
[</>l,</>2] corresponding to the error in angular information. The 
misaligned sector AFC is a sector contained between the angles 
[4>i,<j>2], such that for a randomly chosen point (L, a) from the 
sector, E(L cos a) > 0. 

Consider Figure[3] Let the packet be currently at the point A' 
at the i th step and wish to travel to the destination 'O'. An error 
in location is mathematically equivalent to stating that the next 
hop location is randomly chosen (with an uniform distribution) as 
any point in the sector AFC. The neighbor subset from which we 
choose our relay node is the set A( M ( n ) .^1.^2), where 

A iM ( n )Ah^) — \^ x £ A M(n) : (pi < (ZXA-ZOl) < 02}. 

(4) 

We have assumed that the radial distance of the hop is also ran- 
dom, and not deterministically equivalent to M(n). The random- 
ness in the radial distance (per hop) models a variable power se- 
lection at the node. The analysis in this paper can be directly ex- 
tended to the case when the radial distance is deterministic (or 
any other given distribution). Y^ n '(i) is the Euclidean distance 
traveled towards the destination in the i th step. By definition, 
Y {n) (i) — \OA\ - \OB\. We denote the polar co-ordinates of 
this jump as the pair (L (n) (i), a (n) (i)), where L (n) (i) = \AB\ 
and or n )(i) = ZOAB. Now, let us consider the delay r(n) for 
this routing scheme. The packet's source is A and the destination 
O with \OA\ = 1 for notational simplicity. 

Definition 3.1. We define a random sequence {a(n),n — 1,2,...} 
to be asymptotically almost surely ( a.a.s) bounded by another ran- 
dom sequence {b(n),n = 1,2,...} if3No > such that for all 
n > No, a(n) < b(n) a.s. 

In the rest of the paper, we denote sequences {a(n)} and 
{b(n)} satisfying Definition B.ll bv 

a(n) < b(n) (a.a.s). 

3 We denote g(n) = 0(/(n)), if there exists positive constants ci and 
C2 such that for all n large enough, < c\ < g(n)/f(n) < C2- 



We shall now show in Theorem 13.11 that the delay for this 
scheme is of the order of straight-line routing. To prove Theo- 
rem l3.ll we will need to prove the following Lemma. 

Lemma 3.1 (A Limit theorem for Triangular Arrays). For any 

fixed K > 1, let M(n) — K\J (log n/n). Consider a triangular 
array of bounded i.i.d. (independent and identically distributed) 
random variables X| n ' , 1 < i < n. Then, 

1 

M(n) 

lim M(n) V X}" 5 — > EX{ (a.a.s) 

i=l 

Proof. We have 

1 

p(|M(n) ^ XW-EX\ >e) < 
P[M(n) YZ xM-EX>e) 

i=l 

+P[M(n) x l n) -EX < -e) . (5) 
By the Chernoff Bound, we have 

P[M(n) x l n) ~ EX > e ) < exp~^M /(f) . 

i = l 

where 1(e) is the large-deviations rate function 1 4 1 for the bounded 
random variable. Applying the bound to L.H.S of equation ^5}, 

1 

M(„) 

P[\M(n) Y X< i U) - EX \ > e ) < 2exp"^M /(E) 

i = l 

Also, 

2 exp" I(e) < 00, M(i) = kJ^. 
i=i 1 

Thus, by Borel-Cantelli's Lemma, 

1 

M(n) 

lim M(n) V X^ n) — > EX (a.a.s). 

n — >oo — ^ 

i=l 

□ 

It can be shown that the sequence of random variables {Y^ } 
are not i.i.d., but are history dependent. Thus, we first upper and 
lower bound these random variables by sequence of i.i.d. random 
variables, and a sequence of error terms. 

Lemma 3.2. Let (S, a) be the polar co-ordinates of any point B 
within a circle of radius m, with center A. Let O be any point on 
the plane such that \ OA\ > (m + e). Let e > Then, 

q2 

Scosa < \OA\ - \OB\ < Scosa 



Proof. The proof is presented in the Appendix. □ 

Using the bound in Lemma l3~2l we now derive the main result. 

Theorem 3.1. Let (L, a) be the polar coordinates of a uniformly 
chosen point from a sector within angles [01, 02] and unit radius. 
Let ft = E(Lcos(a)) Then, V positive Ci , C2 : ci < 4 < C2, 



ci / n c 2 / n 

< r(n) < 



log(n) 



(a.a.s).D 



Proof. Recall that y (i) is the distance traveled towards the des- 
tination in the i th step. For a packet located at 'A' at time-step i 
(see Figure and the next hop position being 'B\ our routing 
model implies that Y (n) (i) = \OA\ - \OB\. 

From Lemma l3~2l we have, for all j < r(n), the following 
equations. 



£W(») 2 



}<E y(n) «> (6) 



£ y (n) (*) < £ L (n) (i) cos a (n) (< 



(7) 



Defining L'™- 1 (i) = " L M( .^- > and substituting in equations (6,7), we 



have 



(8) 



M(n)J2L {n) (i) cosa (n) (T)- 

i=l 

( M ^) 2 E^yf < E^ 

i=l ^ ' i=l 



J2Y (n) (i) < M(n)J2L (n \i) cos a <n) (i). (9) 

i=l i=l 

We observe that {L^ (i) cos a'™-* (i)}" =1 are a sequence of i.i.d. 
random variables, with the expected value E{L^ (i) cos (i) } : 
13, such that < [3 < 1. 

Upper Bound: To prove the bounds for the hitting time, let us 
suppose that our claim r(n) < M c ^ VC2 > ^ (a.a.s) is not 
true. Then, there exists a subsequence rik, k = 1, 2, . . . such that 
T"(n&) > M (^ fc ) ■ Note that, from this implies that 



Y Y ( " k) (i) < 1, fc=l,2,... (10) 

1=1 

However, from {S}, (which holds for all j < r(n)), we have 




Fig. 3. A sector with bias. 



almost sure convergence along every subsequence, it follows from 
Lemma b.ll that 



M(n k ) Y L {nk) (i) cos a (nk) (i) -> c 2 (3; c 2 (3 > 1, (12) 
i=i 

as E(L ink) (i) cosa (nfc) (i)) = (3. 



Moreover, since L^ nk ' < 1 and 



M(n) 
e(n) 



0, we have 



2 ^?£(™*)(i) 2 



~ X ^ < M(n t )f £ * (13) 

z = l 



and 



J7777TT 



(14) 



From equations <1 H12ll3l andfT4l we have Iim*.^ £ i=i ^ ( " fc ) 
> 1, which contradicts 1 1 01 . Thus we have shown that r(n) < 
X^y Vc 2 > i (a.a.s). 

Lower Bound: To prove the lower bound, we need this addi- 
tional construction. For each n, let us augment the sequence of 
random variables F'"'(i), 1 < i < n, as follows. Once a packet 
has entered the e(n) ball about the destination, we start a new 
packet from the source to the destination. Thus we define a se- 
quence of random variables Y^ n '(i) for all i. These random vari- 
ables generate the sequence of l/ n ' (i) cos a' n ' (i), Vi. 

Let us assume that the lower bound r(n) > j§hj. Vci < jj 

. Then, there exists 



M(n) 



(a.a.s). is not true. Observe that r(n) > 

a subsequence n k ,K — 1,2,... such that for some r £ (1, ci), 
M{n k )r{n k ) -> r. (15) 



Let W(n) 



M(n) 



We observe that 



r(n fc ) W(n fc ) 
E L (nfc) (i)cosa (nfc) (i) - ^ L (,lfe) (i)cosa (nfc) (j 



M(n fc ) E L (n>!) (i)cosQ ( ™ fc 



(M(n fc )) 2 E 



< e y(nfc) w (ii) 



By substituting X[ n) = L (n) (i) cos a (n) (i) in Lemma |3~T1 and 
noting that almost sure convergence along a sequence implies an 



< \W(n k )-T(n k )\M(n k ). 



(16) 



Thus, we have 



1= ^ r (nfe) (i)< L ink) (i) cos a (nk) (i) < 

i=l i=l 

W(n k ) 

Y L ink \i)cosa { " k) (i) + \W(n k ) -T(n k )\M(n k ). (17) 




Fig. 4. Adversarial Quadrants - where O' and O" correspond to 
the possible "worst-case" directions of the destination. 



Now applying Lemma l3~Tl and equation I15i to 1171 . we get 

W(n k ) 



J2 L (nfc) (i)cosa (nfc) (i) 



r < 1. 



\W{n k )-T(n k )\M(n k ) -> 0. 



(18) 
(19) 



This contradicts our assumption that^^™*' Y^ 1 ^ (i) = 1. Thus, 
by contradiction, we have shown that r(ra) > jjh^ Vci < 
■jj(a.a.s). □ 

Thus, this result implies that for large enough ra, the delay with 
random angular error leads is equal to p^^n) wmcn is clearly the 
same order as that with straight-line routing, with the scaling con- 
stant inversely proportional to the expected value of the projection 
of each step on the line joining the source and destination. 

4. ROUTING WITH QUADRANT INFORMATION 

In the previous section, we had shown that even with GPS error, the 
routing delays were within a constant factor of greedy straight-line 
routing. In this section, we assume that there is some mechanism 
that provides coarse geographic information about the destination, 
such as the quadrant or half-plane in which the destination is lo- 
cated. Under such a scenario, we derive bounds on the routing 
delays. We show that even in an adversarial mode of choosing 
the local quadrants, the routing delay is within a constant factor of 
straight-line routing. Consider the following routing strategy ty 2 
(see Figure[4j- The node 'A' contains a packet at the i th step that 
needs to be routed to the destination 'O' . The strategy adopted is to 
randomly forward the packet to a randomly chosen point 'B' from 
the correct quadrant. 

Further, all nodes need not have the same coordinate system. 
For instance, suppose that 'A' only knows that the final destination 
is locally to the 'North-West' (with respect to its own coordinate 
system). Let us denote the offset between the node's local coordi- 
nate system and the true direction of the destination by a random 
variable k. We will consider two cases: (i) the offset k is assumed 
to be uniformly distributed within the quadrant; and (ii) an adver- 
sarial scenario where « is chosen to be the worst-case at each hop, 
i.e., along the one of the local coordinate axes that minimizes the 
distance traveled toward the destination (see Figured- 
Let the polar representation of 'B' be L 2 (i), a 2 (i). The 
neighbor subset from which the relay node is chosen is given by 



A K 



,M(n) 



= {Node X e A 



M(n) 



k - ~ < Z(XA) < «}. 



We first consider the case where the angle k = AO AC is 
assumed to be uniformly distributed in [0, ^]. This is equivalent to 
picking a node 'B' from a semicircular AFC (cj>\ = 02 = ■§ 
in Figure|3j with a probability distribution 



a 2 



r L 2 .c*2 ' 



z < M,\a 2 \ < -. 



(20) 



As before , let the source be a unit distance away from the des- 
tination. Let us define the hitting time for the path to hit the e(ra) 
ball around the destination as Tzin) in the first scenario (uniform 
k), and Ts(n) in the adversarial scenario. The following theorem 
provides bounds for the hitting time in both these scenarios. 

Theorem 4.1. (i). Uniformly random n: Let (L 2 , a 2 ) be the po- 
lar representation of a point chosen from a semicircular sector 
( < »2 < \) of a unit circle, with a probability distribution 
fL 2 ,a 2 as m t20t . Let (5 2 = E(L 2 cosa 2 ). Then, for all posi- 
< £ < c 6 , we have f J^fa < r 2 (n) < 



tive C5 , C6 



(ii). Adversarial choice of quadrants: Let (L3, 7) be the polar 
representation of a point chosen randomly from a quadrant con- 
taining the destination, where 7 is the angle with respect to the 
local quadrant. Let = E(Lz min(cos 7, sin 7)). Then for all 
positive c : i < c, we have r 3 (n) < ^-yiS^J (a.a.s). 

Proof. ( i). Uniformly random k: Consider any node B in the semi- 
circle AFC in Figure Btb). For any step i < T2(n), we have the 
following bounds for Z^ n \i), the distance traveled towards the 
destination in the i th step. The following bounds are similar to 
equations |6j and Q. 



Z {n) {i) = \OA\ - \OB\ 



(21) 



; («) / 



J2 ' (i) cos a 2 n) ( i) 



< Z in) {i) 



e(n) 

< L ( 2 n \i) cos a { 2 n \i). (22) 

The rest of the proof is analogous to Theorem 13.11 where we sub- 
stitute L ( 2 n) for L {n \ r 2 (n) for r(n), Z {n) for Y {n \a 2 for a and 
(3 2 for /3. The details are skipped for brevity, (ii). Adversarial 
choice of quadrants: From Figure|4la), it is clear that once a node 
'B' is selected, the distance traveled towards the destination is min- 
imized if the destination O was along either O' or O", whichever 
is more unfavorable. Thus, the distance traveled towards the desti- 
nation in the i th step is bounded below by 



min jL< n) (i) cos7 (n) (i) 



e(n) 



L( n) (i)sin 7 ( " ) (i) - L *\> j < Z (n) (i). 

era J 



(23) 



By arguments similar to Theorem l3.ll we can show that T2,(n) < 
c mM ( a - a - s )- n 

The quadrant information can be replaced by half-plane in- 
formation and still lead to a delay that is within a constant fac- 
tor of straight-line routing, if the uniform « assumption is made. 
However, half-plane information is not sufficient for order-wise 
straight-line routing delay in an adversarial scenario. 



5. ROUTING WITH FRACTIONAL INFORMATION 



Now, let us suppose that 



In this section, we consider the case where only a small fraction 
of the nodes have any routing information at all. Most nodes sim- 
ply forward the packet to a randomly selected neighbor. A small 
fraction of the nodes have routing information (either quadrant in- 
formation, or GPS information with errors). Such routing infor- 
mation could be distributed by some gossip mechanism (routing 
table updates) 1 171 1141 . where nodes forward routing information, 
but also could clear routing tables after some time. We do not ex- 
plicitly model the dynamics of such messaging. Instead, we adopt 
the following simple model for routing. 

We assume that each point has routing information (either im- 
precise GPS, or quadrant information) with a fixed probability 
p£ (0, 1), independent of any other event. With probability 1— p, 
the next hop location is uniformly chosen from a circle of radius 
M(n) about the current location (i.e., random routing). In this 
section, we explicitly derive the results only for the quadrant rout- 
ing strategy. Analogous results hold when only a fraction of the 
nodes have imprecise GPS information. With such a strategy, let 
us denote the event that the i th hop location contains quadrant in- 
formation 4 by E(i) . As before, we normalize the distance between 
the source and destination, and denote the routing delay under the 
strategy described above by the random variable r p (n). 

Theorem 5.1. Let 02 be defined as in Theorem 14.71 Then for all 
positive ci , C2 



ci < < C2, we have 



Cl 

K 



log(ra) 



(a.a.s). 



Proof. Let Q {n '(i) be the random distance traveled towards the 
destination in the i th step. Then, 



Q w (i)=Z w (i)l B(j) +/i w («)l ssW 



(")/ 



(24) 



where Z' n ^ (i) is the random distance traveled towards destination 
with a quadrant information strategy, and R} n ' (i) is the distance 
traveled without any information. Let 'B' be the next hop location 
if event E(i) occurs, else let the next hop location be 'Bl'. Let 
(Lip (i), ot£ (i)) and (Sp (i), (i)) be the polar coordinates 
of the nodes 'B' and 'Bl' respectively. Location 'B' is defined 
identically to the next hop location in Section|4j and location 'B 1 ' 
is chosen uniformly from a circle of radius M(n) about 'A' . Let us 
define 

P^(i) = (L«(i) cos - ±J£L}l m + 

{5^(i)cosaf (25) 
e(n) 



T< n) (i) = L^(i) cos a^(t)l E(€) + 

cos 4 n) (i)l a . (1) (26) 
From Lemma lX2l we have the following bound for all i < T p (n). 



P (n) {i) < Q W (i) < T w (i) 



(«)/ 



(27) 



4 For notational convenience, we suppress explicitly showing the depen- 
dence of E(i) on n 



T p (n) < c 2 



M(n) 



(a.a.s.) 



is not true for some C2 > Then, there exists a subsequence 
n k ,k = 1,2,... such that T p (n k ) > jffa- Now, along this 
subsequence, 



P (nk) (i) = M(n k ) {L p nk) ii) cos a { £ k) (i 

i—l i=l 



-M(n k 



L p nk) (i)' 
e(rifc) 



}l Hi) +M(n k ) Yl {S^Hi) cos a { s k) (i) 



(28) 



The terms on the R.H.S of j28j are a triangular array of i.i.d. 
random variables. Noting that Sp (i) cos oig (i) is a symmetric 
random variable with mean zero, the limit of the sum in equation 

els 



M(„ fc ) 



lim V P (nk) (i) = c 2 f3p 



(29) 



which is greater than unity. This contradicts the fact that jj^j was 
smaller than r p (n), as the path has already reached the destination. 
Hence, 

Similarly, using equation ^26> we show that 

□ 



6. SIMULATION RESULTS FOR ROUTING DELAY 

We have so far assumed a continuum model of nodes in the unit 
square. In this section, we account for the discretization effects, 
and simulate the various scenarios discussed earlier. We consider 
a simulation scenario where nodes are placed uniformly ran- 
domly on a unit square. The source is located at [0, 0] and the 
destination at [0.7, 0.7] (such that the Euclidean distance between 
the source and the destination is one). A histogram of the routing 
delays (number of hops) from 150 simulations is plotted, along 
with a sample path for illustration. 

The simulations are for a node density = 1000. The geo- 
graphic greedy routing strategy, where the relay node is the neigh- 
bor node that is closest to the destination shows an almost de- 
terministic path length of 7 hops and the corresponding sample 
path resembles a straight-line path from the source to the destina- 
tion. This simulation corresponds to a "small-scale" network as 
the number of hops with straight-line routing is relatively small. 
The small variations in the path length occur due to the random- 
ness in the node positions. With unbiased sectors (of 60 degrees), 
our simulation results indicate that the average path length is about 
1 1 hops, which is an increase by a factor predicted in Theorem l3.ll 
(the analysis predicts a length of 1 1.01). Routing with biased GPS 



information is considered next, and the sample path shows some 
spiraling (Figure|5Ja)) due to bias in routing, and the average rout- 
ing delay is about 15 hops. The quadrant based routing strategy is 
simulated next in this setup, and the results are shown in Figure|5| 
The sample path is observed to be similar to the sector routing 
case, and the average routing delay of 15 hops is only marginally 
more than the sector routing strategy. Both of these are again close 
to that predicted by our analytical results. Routing with fractional 
information is simulated by assuming that a node contains routing 
(quadrant) information with a probability of p — .35. The sample 
path and the distribution of routing delay are shown in Figure 
The routing path is considerably lengthened as most of the nodes 
do not contain routing information. The average delay in this case 
is approximately 40 hops, which is close to the analytically pre- 
dicted value (42.7 hops). These plots indicate that the random 
routing strategies have delays that are comparable to the greedy 
geographic routing strategy, as predicted by our analysis. 

The simulations are repeated for a larger network with N = 
10000 nodes. The number of hops for a greedy geographic rout- 
ing strategy is about 28 hops, which is about four times as that in 
the previous case. The analogous results for the five routing strate- 
gies are displayed in Figuresl8l ll2l The average fractional routing 
delay is about 120 hops (Figure fT2t b)). which is approximately a 
- factor increase from the routing delay for the quadrant routing 
scheme (which has an average routing delay of 42 hops). The spi- 
ralling drift of a routing scheme with directional bias is also seen 
in Figure fTol 




(a) Sample Path (b) Distribution 



Fig. 5. Biased Sector Routing- Spiraling drift - 1000 nodes 




(a) Sample Path (b) Distribution 



Fig. 6. Quadrant based Routing - 1000 nodes 



7. THROUGHPUT CAPACITY WITH PROGRESSIVE 
ROUTING 

In the previous sections, we had assumed a continuum model of 
a sensor network for the analysis of routing delays. To obtain 




(a) Sample Path (b) Distribution 



Fig. 7. Fractional information: 35% have quadrant information 
-1000 nodes 




(a) Sample Path (b) Distribution 



Fig. 8. Straight Line Routing - 10,000 nodes 



the throughput capacity, we need to consider individual nodes and 
their data-rates. Thus, in this section, we use a discrete node model 
of the sensor network. We assume that the n nodes are randomly 
placed on a unit square, and as before, the transmission radio range 

of the nodes is M(n) = K^J 1 -^ 1 . To avoid technical compli- 
cations due to edge effects, we assume that paths wrap around 
the edges of the unit square. Thus, the distance between any two 
nodes is simply the shortest straight-line path between them (pos- 
sibly with wrap-around). The scaling parameters are the same as 
in the continuum model. We also assume the Protocol Model 1 8 1 
for successful transmissions. 

Definition 7.1. The transmission protocol is called the protocol 
model if the transmission from node A to B is successful ifd{A, B) < 
M(n) and d(Ci, B) > (1 + A)M(n) for all other transmitting 
nodes Ci. 

We consider progressive routing strategies that ensure that at 
each step of the route, the distance to the destination decreases 
by at least SM(n) for some S > 0. For example, routing with 
sector information (considered in Section|3j will lead to progres- 
sive routing if the transmit power exceeds a minimum threshold, 
and the bias is not large. We assume that a routing strategy that 
satisfies this property is used for route setup, and subsequent pack- 
ets in each flow (between a source-destination pair) follows this 
initial path. Further, the routes are independently setup (across 
flows). It has been shown in 1 8 1 that for routing with straight-lines, 
the throughput capacity is Q( ^ nlogn ). f° r the protocol model, 
and the upper bound on the throughput capacity with the protocol 
model is also of this order. However, with the addition of random- 
ness in routing, the capacity of the network could be reduced, as 
discussed in SectionQ The issue of concern is that the longer rout- 
ing paths due to the random strategies might create local hot spots 
(see Figure^a)). We show that, for progressive routing schemes, 




(a) Sample Path (b) Distribution 

Fig. 9. Unbiased Sector Routing - 10,000 nodes 




(a) Sample Path (b) Distribution 



Fig. 10. Biased Sector Routing- Spiralling drift - 10,000 nodes 



such local hot spots do not affect the throughput capacity in an 
order-wise sense. 

Theorem 7.1. Consider a unit square, with n nodes uniformly dis- 
tributed, and n/2 randomly chosen source-destination pairs. Let 
$ be a progressive routing strategy such that in each hop, the Eu- 
clidean distance to the destination is reduced by 5M{n). Then, 
under the Protocol Model, a data rate of 6( . I ) is simulta- 

K \/n log n ' 

neously achievable by every source-transmitter pair with routing 
strategy ty. 

Proof. We provide a sketch of the proof. Consider a uniform tiling 
of the unit square, by tiles of side a(n) — \J The idea of the 
proof is as follows: 

1. We show that each tile is active (i.e., nodes in the tile are 
allowed to transmit) for a fixed fraction of the time, without 
being interfered by transmissions from other tiles. 

2. Observe that with progressive routing, each route could have 
multiple hops in each tile. We prove that an uniform upper 
bound on the number of hops in any tile summed over all 
routes is O(Vnlogn). 

3. Using these results, we show that each route receives a data- 
rate of 9(-r4 ). 

v V n log n 

The above statements are proved in the following claims. 

It is clear from Definition 17.11 that if there is a transmission 
from a node Ai in some tile, other transmissions in neighboring 
tiles can affect the transmissions of Ai. However, since A is a 
constant, the number J of nearby tiles that can affect the transmis- 
sion is finite. We use this fact to construct a transmission schedule 
that allows for concurrent spatial transmissions. The problem is 
equivalent to a graph coloring problem with each vertex having at 
most a degree of J. Standard results from graph theory indicate 




(a) Sample Path (b) Distribution 



Fig. 11. Quadrant based Routing - 10,000 nodes 




(a) Sample Path (b) Distribution 



Fig. 12. Fractional information: 35% have quadrant information 
-10,000 nodes 



that a graph with a degree no more than J can have all its vertices 
colored by J + 1 colors such that no two neighbors have the same 
color. Thus, we can color the tiles with J + 1 colors such that 
no two interfering tiles have the same color. We can construct a 
schedule such that a given slot is divided into J + 1 sub-slots and 
all tiles of the same color can successfully transmit simultaneously. 

We assume that the strategy allows us to travel a distance of 
at least 8M(n) towards the destination in each jump. Thus, the 
number of hops required to reach the destination (that is a unit 
distance away) for any route is no more than jj^tt hops. Also, 
since the maximum length that can be traveled in any hop is M(n) 
the length of any routing path is upper bounded by j, an order 1 
quantity. 

Claim 1. Given that a routing path passed through a tile, the num- 
ber of hops inside the tile is no more than 

Proof. Assume that the required destination is outside the tile. 
Then, in steps, the packet would have reached closer to the 
destination by more than \f2M{n), which would imply that the 
packet is no longer in the same tile. Even if all the intermediary 
steps fell inside the tile, the number of hops cannot be greater than 
. If the destination was inside the tile, it would have reach the 
destination within steps. □ 

Claim 2. The total number of tiles any path can touch is upper 
bounded by .,}, . . 

Since the total number of hops in any path is at most S M( n ) 
and all these hops can at best be in separate tiles, the claim holds. 

Let be a Bernoulli random variable, with X* — 1 if the 
i th path touched the k th tile. Clearly, X\ is independent of X\ if 
i 7^ j, as the paths are independently routed with respect to each 



other, although and X\ are correlated. The random variable 
Xi is stochastically dominated by an i.i.d Bernoulli process X k 
with 

j^fc _ / 1 w.p a(n) 



for any 



a(n) > 



w.p 1 — a(n) 



maximum number of tiles touched by any path 
Total number of tiles 



This follows from the observation that for any path i, P(Xf = 1) 
, the probability that the k th tile is touched by the i th path is the 
same for any k, as the source/destinations are uniformly distributed 
in the unit square. From symmetry, the probability that a tile is 
touched by a path is equal to the fraction of the tiles touched by 
the path, which is a(n). 

Noting that the number of tiles is M } . a , we have for a(n) — 

Mipl > stochastically dominates X*. We can use the above 
results to provide an upper bound on the maximum number of hops 
H (n) in any tile, which is given by 



H(n) 



V2f vk 
—— max x x 

5 \ k 

i = l 



(30) 



Claim 3. H{n) < almost surely, for /i(n) = 



a/ n log n 



+ 



61 gn^f^. 



Proof. 



log T 



i=l k=l i=l 

log n n 

< nP(^Xf > M (n)) (31) 



The first inequality (a) is a union bound on the probability that 
H(n) > fj.(n) and inequality (b) is due to the fact that X stochas- 
tically dominates X. Let X k = J27=i ■ Now ' from G3, we 
have, for sums of i.i.d bernoulli random variables that 

P(x k > (1 + P)E{X k )\ < e -P 2 ®(x k )/2^ (32) 



We also know that E{X k ) = KV 'y° g " - from previous definitions 



of X. Taking (3 = y 61ogn/£(X fe ) in Equation l32l we get 

P(x fe > £(X fc ) + J&\ognE(X k )) < -r. (33) 



V - V • / fjo 

By summing over all k in Equation Oll we show that for all ^i(n) > 
v "^ og " + ^6 logn^^, we can show that 

p(H{n) > fa)) < ^. 
The almost sure convergence follows by Borel-Cantelli. □ 




Fig. 13. Geometric interpretation of the problem - Sector Informa- 
tion 



We now outline a scheduling strategy for achieving the data- 
rate proposed. Consider a time-slot of fixed length T. We divide 
the time slot into J + 1 slots of duration -Xj and these slots are 
allocated to tiles with the corresponding colors. Within the tile, the 
time-slots are further divided into r(n) sub-slots, where r(n) is the 
number of hops inside the tile. From the previous claims, it is seen 
that r(n) < R^/n log n for a large enough but finite R. Thus, 
we can guarantee that each hop is guaranteed a transmission time 
of - ; - 1 , for every slot of time T. For any source-destination 

Hv)i log n 

pair, all the hops support a data-rate of R / n lo n ', and hence this 
throughput is achievable. □ 

8. CONCLUSION 

In this paper, we have presented geographic routing strategies where 
the nodes have erroneous or limited information about the desti- 
nation location, and have analyzed the asymptotic routing delays 
with such schemes. Our analysis shows that even with limited des- 
tination information (as in quadrant routing) or erroneous angular 
information, the routing delays are order-wise the same as straight- 
line routing. Simulation results indicate that the discretization ef- 
fects due to node locations are small, and there is a good match 
between the simulation results and that predicted by our analysis. 
We also show that for progressive routing strategies that carry the 
packet closer to the destination in each hop, the capacity is order- 
wise the same as a straight-line routing strategy. 



Appendix 

Proof of Lemma l3~21 

Proof. Consider Figure IT3l Let B be any point inside the circle, 
and let (S, a) be the polar representation of the point. It is clear 
from the figure that OB = OA + AB. Now, we have 



\OA\ - \OB\ 



\OA\ 2 



\OB\ 



\OA\ + \OB\ 



(34) 



where | OA j — | OB \ is the distance traveled towards the destination 
O in that jump. Substituting for OB in 1341 . we obtain \OA\ — 
\OB\ 



\OA\- 



]OA\' 2 + \AB\ 2 -2\AB\\OA\cosa] 



\OA\ + \OB\ 

2S\OA\cosa- S 2 
\OA\ + \OB\ 



(35) 



(36) 



When \OA\ - \OB\ > 0, we have 

S 2 ^ 2S\OA\cosa- S 2 .— t, ,„„ N 
Scosa- — < — =5 = \OA - OB . (37) 

OA OA + | OB | 

The inequality in I37i follows from that fact that \OA\ > \OB\, 
and by replacing O A \ + | OB with 2| OA to get the lower bound. 
Next, when |OA] - \OB\ < 0, we have 

25* | OA | cos a-S 2 < 25 , |OA|cosa-5' 2 
2|OA| ~ |OA| + |OB| 

To see this, observe that the RHS of I38i is a negative quantity 
(follows from the equality in <36» . Thus, in order to get a lower 
bound, we replace OA + \OB\ by two times the smaller of the 
two terms, i.e., 2|OA| (because in this case, | OA] < \OB\). 

Thus, as S 2 > and e < OA|, from J36), and GHK we 
have 

Q2 . > 

Scosa- — < OA - \OB\. (39) 

e 

Similarly, for any point inside the circle, we can show that 
| OA | - \OB\ < Scosa. We skip the details. □ 
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